Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 105034 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 107 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 16.8 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Text | 2 |
| DateTime | 2 |
| Numeric | 14 |
| Dataset has 107 (0.1%) duplicate rows | Duplicates |
Average_Rating is highly overall correlated with Hotel_Name and 5 other fields | High correlation |
Hotel_Name is highly overall correlated with Average_Rating and 8 other fields | High correlation |
Num_of_Ratings is highly overall correlated with Hotel_Name and 1 other fields | High correlation |
cleanliness_score is highly overall correlated with Average_Rating and 6 other fields | High correlation |
comfort_score is highly overall correlated with Average_Rating and 5 other fields | High correlation |
employee_friendliness_score is highly overall correlated with Average_Rating and 5 other fields | High correlation |
facility_score is highly overall correlated with Average_Rating and 6 other fields | High correlation |
hotel_grade is highly overall correlated with Hotel_Name and 3 other fields | High correlation |
location_score is highly overall correlated with Hotel_Name | High correlation |
value_for_money_score is highly overall correlated with Average_Rating and 5 other fields | High correlation |
is_photo is highly imbalanced (71.9%) | Imbalance |
Helpfulness has 95776 (91.2%) zeros | Zeros |
Deviation of star ratings has 3010 (2.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-14 13:54:37.274498 |
|---|---|
| Analysis finished | 2025-01-14 13:55:10.207205 |
| Duration | 32.93 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
Hotel_Name
Categorical
High correlation 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
| blakemore | 4988 |
|---|---|
| park-plaza-county-hall | 4988 |
| lancaster-gate | 4982 |
| thistletower | 4982 |
| milleniumgloucester | 4976 |
| Other values (28) |
Length
| Max length | 35 |
|---|---|
| Median length | 22 |
| Mean length | 15.897671 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | studios2let |
|---|---|
| 2nd row | studios2let |
| 3rd row | studios2let |
| 4th row | studios2let |
| 5th row | studios2let |
Common Values
| Value | Count | Frequency (%) |
| blakemore | 4988 | 4.7% |
| park-plaza-county-hall | 4988 | 4.7% |
| lancaster-gate | 4982 | 4.7% |
| thistletower | 4982 | 4.7% |
| milleniumgloucester | 4976 | 4.7% |
| stgileshotel | 4973 | 4.7% |
| zedwell-trocaderor | 4968 | 4.7% |
| z-trafalgar | 4817 | 4.6% |
| marlin-waterloo | 4423 | 4.2% |
| nyx-hotel-london-by-leonardo-hotels | 3846 | 3.7% |
| Other values (23) | 57091 |
Length
| Value | Count | Frequency (%) |
| blakemore | 4988 | 4.7% |
| park-plaza-county-hall | 4988 | 4.7% |
| lancaster-gate | 4982 | 4.7% |
| thistletower | 4982 | 4.7% |
| milleniumgloucester | 4976 | 4.7% |
| stgileshotel | 4973 | 4.7% |
| zedwell-trocaderor | 4968 | 4.7% |
| z-trafalgar | 4817 | 4.6% |
| marlin-waterloo | 4423 | 4.2% |
| nyx-hotel-london-by-leonardo-hotels | 3846 | 3.7% |
| Other values (23) | 57091 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 178106 | |
| o | 164498 | 9.9% |
| t | 161718 | 9.7% |
| l | 153189 | 9.2% |
| a | 127848 | 7.7% |
| r | 118084 | 7.1% |
| n | 97466 | 5.8% |
| s | 89698 | 5.4% |
| - | 85943 | 5.1% |
| i | 74030 | 4.4% |
| Other values (16) | 419216 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1669796 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 178106 | |
| o | 164498 | 9.9% |
| t | 161718 | 9.7% |
| l | 153189 | 9.2% |
| a | 127848 | 7.7% |
| r | 118084 | 7.1% |
| n | 97466 | 5.8% |
| s | 89698 | 5.4% |
| - | 85943 | 5.1% |
| i | 74030 | 4.4% |
| Other values (16) | 419216 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1669796 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 178106 | |
| o | 164498 | 9.9% |
| t | 161718 | 9.7% |
| l | 153189 | 9.2% |
| a | 127848 | 7.7% |
| r | 118084 | 7.1% |
| n | 97466 | 5.8% |
| s | 89698 | 5.4% |
| - | 85943 | 5.1% |
| i | 74030 | 4.4% |
| Other values (16) | 419216 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1669796 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 178106 | |
| o | 164498 | 9.9% |
| t | 161718 | 9.7% |
| l | 153189 | 9.2% |
| a | 127848 | 7.7% |
| r | 118084 | 7.1% |
| n | 97466 | 5.8% |
| s | 89698 | 5.4% |
| - | 85943 | 5.1% |
| i | 74030 | 4.4% |
| Other values (16) | 419216 |
Review_Text
Text
| Distinct | 93986 |
|---|---|
| Distinct (%) | 89.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
Length
| Max length | 3534 |
|---|---|
| Median length | 1872 |
| Mean length | 205.16283 |
| Min length | 1 |
Unique
| Unique | 93355 ? |
|---|---|
| Unique (%) | 88.9% |
Sample
| 1st row | Perfect location with good connections and shops and pubs |
|---|---|
| 2nd row | The room had everything you needed Near to amenities, was good room for price just needs little updatingThe bed was so hard it felt like sleeping on a hard floor, you had to make sure you had something on your feet as flooring pinched you feet needs changing |
| 3rd row | Conveniently nearby St Pancras, very small but clean and pleasant room first floor with small balcony to street side Interesting areaLuggage service can be improved by offering to lock luggage up instead of it just being put into the hall with all risks on the guests |
| 4th row | Reception staffed 24 hours a dayAll good |
| 5th row | Very convenient to Kings Cross and the cityA little dated could do with a lick of paint |
| Value | Count | Frequency (%) |
| the | 212593 | 5.5% |
| and | 145959 | 3.8% |
| was | 122391 | 3.2% |
| to | 97379 | 2.5% |
| a | 90811 | 2.4% |
| room | 73085 | 1.9% |
| in | 64560 | 1.7% |
| very | 53705 | 1.4% |
| for | 50911 | 1.3% |
| of | 49501 | 1.3% |
| Other values (69191) | 2873059 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3749964 | ||
| e | 2067368 | 9.6% |
| o | 1596721 | 7.4% |
| t | 1544636 | 7.2% |
| a | 1483302 | 6.9% |
| n | 1148704 | 5.3% |
| r | 1050903 | 4.9% |
| i | 1028863 | 4.8% |
| s | 963141 | 4.5% |
| l | 824293 | 3.8% |
| Other values (58) | 6091178 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21549073 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3749964 | ||
| e | 2067368 | 9.6% |
| o | 1596721 | 7.4% |
| t | 1544636 | 7.2% |
| a | 1483302 | 6.9% |
| n | 1148704 | 5.3% |
| r | 1050903 | 4.9% |
| i | 1028863 | 4.8% |
| s | 963141 | 4.5% |
| l | 824293 | 3.8% |
| Other values (58) | 6091178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21549073 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3749964 | ||
| e | 2067368 | 9.6% |
| o | 1596721 | 7.4% |
| t | 1544636 | 7.2% |
| a | 1483302 | 6.9% |
| n | 1148704 | 5.3% |
| r | 1050903 | 4.9% |
| i | 1028863 | 4.8% |
| s | 963141 | 4.5% |
| l | 824293 | 3.8% |
| Other values (58) | 6091178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21549073 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3749964 | ||
| e | 2067368 | 9.6% |
| o | 1596721 | 7.4% |
| t | 1544636 | 7.2% |
| a | 1483302 | 6.9% |
| n | 1148704 | 5.3% |
| r | 1050903 | 4.9% |
| i | 1028863 | 4.8% |
| s | 963141 | 4.5% |
| l | 824293 | 3.8% |
| Other values (58) | 6091178 |
Posted_Date
Date
| Distinct | 1107 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
| Minimum | 2021-12-01 00:00:00 |
|---|---|
| Maximum | 2024-12-13 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Rating
Real number (ℝ)
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.7462888 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 7 |
| median | 8 |
| Q3 | 9 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8802602 |
|---|---|
| Coefficient of variation (CV) | 0.24273045 |
| Kurtosis | 2.2232264 |
| Mean | 7.7462888 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.3300718 |
| Sum | 813623.7 |
| Variance | 3.5353784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 32154 | |
| 9 | 19382 | |
| 10 | 17620 | |
| 7 | 17356 | |
| 6 | 7226 | 6.9% |
| 5 | 4339 | 4.1% |
| 4 | 2404 | 2.3% |
| 3 | 1822 | 1.7% |
| 1 | 1769 | 1.7% |
| 2 | 890 | 0.8% |
| Other values (15) | 72 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1769 | |
| 2 | 890 | 0.8% |
| 2.5 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 3 | 1822 | |
| 3.8 | 1 | < 0.1% |
| 4 | 2404 | |
| 4.6 | 1 | < 0.1% |
| 5 | 4339 | |
| 5.4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 17620 | |
| 9.6 | 15 | < 0.1% |
| 9.2 | 12 | < 0.1% |
| 9 | 19382 | |
| 8.8 | 7 | < 0.1% |
| 8.3 | 8 | < 0.1% |
| 8 | 32154 | |
| 7.9 | 7 | < 0.1% |
| 7.5 | 5 | < 0.1% |
| 7.1 | 3 | < 0.1% |
Average_Rating
Real number (ℝ)
High correlation 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8544776 |
| Minimum | 7 |
|---|---|
| Maximum | 8.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 7.6 |
| median | 7.8 |
| Q3 | 8.2 |
| 95-th percentile | 8.6 |
| Maximum | 8.7 |
| Range | 1.7 |
| Interquartile range (IQR) | 0.6 |
Descriptive statistics
| Standard deviation | 0.43647323 |
|---|---|
| Coefficient of variation (CV) | 0.055569988 |
| Kurtosis | -0.50992273 |
| Mean | 7.8544776 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 0.027029348 |
| Sum | 824987.2 |
| Variance | 0.19050888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.7 | 23449 | |
| 7.4 | 9944 | |
| 7.8 | 9375 | 8.9% |
| 8.4 | 8663 | 8.2% |
| 7.9 | 8049 | 7.7% |
| 8.3 | 7662 | 7.3% |
| 7 | 6994 | 6.7% |
| 8.6 | 6957 | 6.6% |
| 7.6 | 5548 | 5.3% |
| 8 | 5233 | 5.0% |
| Other values (5) | 13160 |
| Value | Count | Frequency (%) |
| 7 | 6994 | 6.7% |
| 7.1 | 2075 | 2.0% |
| 7.4 | 9944 | |
| 7.5 | 2199 | 2.1% |
| 7.6 | 5548 | 5.3% |
| 7.7 | 23449 | |
| 7.8 | 9375 | 8.9% |
| 7.9 | 8049 | 7.7% |
| 8 | 5233 | 5.0% |
| 8.1 | 4423 | 4.2% |
| Value | Count | Frequency (%) |
| 8.7 | 2607 | 2.5% |
| 8.6 | 6957 | 6.6% |
| 8.4 | 8663 | 8.2% |
| 8.3 | 7662 | 7.3% |
| 8.2 | 1856 | 1.8% |
| 8.1 | 4423 | 4.2% |
| 8 | 5233 | 5.0% |
| 7.9 | 8049 | 7.7% |
| 7.8 | 9375 | 8.9% |
| 7.7 | 23449 |
Num_of_Ratings
Real number (ℝ)
High correlation 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11837.911 |
| Minimum | 5613 |
|---|---|
| Maximum | 39497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 5613 |
|---|---|
| 5-th percentile | 5898 |
| Q1 | 7132 |
| median | 9767 |
| Q3 | 13923 |
| 95-th percentile | 20956 |
| Maximum | 39497 |
| Range | 33884 |
| Interquartile range (IQR) | 6791 |
Descriptive statistics
| Standard deviation | 7251.4155 |
|---|---|
| Coefficient of variation (CV) | 0.61255872 |
| Kurtosis | 7.2443839 |
| Mean | 11837.911 |
| Median Absolute Deviation (MAD) | 3252 |
| Skewness | 2.5857582 |
| Sum | 1.2433831 × 109 |
| Variance | 52583027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12340 | 4988 | 4.7% |
| 11045 | 4988 | 4.7% |
| 20956 | 4982 | 4.7% |
| 14445 | 4982 | 4.7% |
| 15320 | 4976 | 4.7% |
| 14989 | 4973 | 4.7% |
| 39497 | 4968 | 4.7% |
| 13923 | 4817 | 4.6% |
| 10695 | 4423 | 4.2% |
| 9394 | 3846 | 3.7% |
| Other values (24) | 57091 |
| Value | Count | Frequency (%) |
| 5613 | 1903 | |
| 5715 | 1969 | |
| 5898 | 1856 | |
| 5932 | 2263 | |
| 5933 | 2424 | |
| 6120 | 2070 | |
| 6248 | 1756 | |
| 6277 | 2482 | |
| 6335 | 2139 | |
| 6404 | 2021 |
| Value | Count | Frequency (%) |
| 39497 | 4968 | |
| 20956 | 4982 | |
| 15320 | 4976 | |
| 14989 | 4973 | |
| 14445 | 4982 | |
| 13923 | 4817 | |
| 12641 | 3542 | |
| 12340 | 4988 | |
| 11670 | 3478 | |
| 11045 | 4988 |
Helpfulness
Real number (ℝ)
Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10283337 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 95776 |
| Zeros (%) | 91.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.36626649 |
|---|---|
| Coefficient of variation (CV) | 3.5617475 |
| Kurtosis | 60.791416 |
| Mean | 0.10283337 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.3616557 |
| Sum | 10801 |
| Variance | 0.13415114 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95776 | |
| 1 | 8084 | 7.7% |
| 2 | 939 | 0.9% |
| 3 | 164 | 0.2% |
| 4 | 39 | < 0.1% |
| 5 | 20 | < 0.1% |
| 6 | 7 | < 0.1% |
| 10 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 95776 | |
| 1 | 8084 | 7.7% |
| 2 | 939 | 0.9% |
| 3 | 164 | 0.2% |
| 4 | 39 | < 0.1% |
| 5 | 20 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 7 | < 0.1% |
| 5 | 20 | < 0.1% |
| 4 | 39 | < 0.1% |
| 3 | 164 | 0.2% |
| 2 | 939 | 0.9% |
| 1 | 8084 |
is_photo
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
| 0 | |
|---|---|
| 1 | 5120 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99914 | |
| 1 | 5120 | 4.9% |
review_title
Text
| Distinct | 53217 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
Length
| Max length | 120 |
|---|---|
| Median length | 104 |
| Mean length | 31.249291 |
| Min length | 1 |
Unique
| Unique | 50754 ? |
|---|---|
| Unique (%) | 48.3% |
Sample
| 1st row | Exceptional |
|---|---|
| 2nd row | Very good |
| 3rd row | Convenient location |
| 4th row | Peaceful position in an elegant street close to 3 major stations and the Bloomsbury area |
| 5th row | Great little gem in the city centre |
| Value | Count | Frequency (%) |
| good | 29477 | 5.2% |
| location | 19829 | 3.5% |
| and | 19725 | 3.5% |
| very | 18806 | 3.3% |
| stay | 17170 | 3.0% |
| great | 16637 | 2.9% |
| a | 16358 | 2.9% |
| for | 13780 | 2.4% |
| the | 13679 | 2.4% |
| hotel | 12911 | 2.3% |
| Other values (11270) | 393201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 472368 | ||
| e | 307605 | 9.4% |
| o | 288012 | 8.8% |
| a | 248364 | 7.6% |
| t | 240166 | 7.3% |
| n | 185082 | 5.6% |
| l | 160008 | 4.9% |
| r | 158081 | 4.8% |
| i | 152253 | 4.6% |
| s | 116322 | 3.5% |
| Other values (56) | 953977 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3282238 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 472368 | ||
| e | 307605 | 9.4% |
| o | 288012 | 8.8% |
| a | 248364 | 7.6% |
| t | 240166 | 7.3% |
| n | 185082 | 5.6% |
| l | 160008 | 4.9% |
| r | 158081 | 4.8% |
| i | 152253 | 4.6% |
| s | 116322 | 3.5% |
| Other values (56) | 953977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3282238 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 472368 | ||
| e | 307605 | 9.4% |
| o | 288012 | 8.8% |
| a | 248364 | 7.6% |
| t | 240166 | 7.3% |
| n | 185082 | 5.6% |
| l | 160008 | 4.9% |
| r | 158081 | 4.8% |
| i | 152253 | 4.6% |
| s | 116322 | 3.5% |
| Other values (56) | 953977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3282238 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 472368 | ||
| e | 307605 | 9.4% |
| o | 288012 | 8.8% |
| a | 248364 | 7.6% |
| t | 240166 | 7.3% |
| n | 185082 | 5.6% |
| l | 160008 | 4.9% |
| r | 158081 | 4.8% |
| i | 152253 | 4.6% |
| s | 116322 | 3.5% |
| Other values (56) | 953977 |
hotel_grade
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
| 4 | |
|---|---|
| 3 | |
| 5 | |
| 0 | 4968 |
| 2 | 2174 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 105034 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 51016 | |
| 3 | 39617 | |
| 5 | 7259 | 6.9% |
| 0 | 4968 | 4.7% |
| 2 | 2174 | 2.1% |
employee_friendliness_score
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.5368243 |
| Minimum | 7.5 |
|---|---|
| Maximum | 9.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 7.5 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 8.3 |
| median | 8.6 |
| Q3 | 8.7 |
| 95-th percentile | 9.1 |
| Maximum | 9.1 |
| Range | 1.6 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.36846077 |
|---|---|
| Coefficient of variation (CV) | 0.04316134 |
| Kurtosis | 0.58257582 |
| Mean | 8.5368243 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.69152495 |
| Sum | 896656.8 |
| Variance | 0.13576334 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.7 | 21523 | |
| 8.6 | 15566 | |
| 8.1 | 9941 | |
| 8.4 | 9534 | |
| 9.1 | 9393 | |
| 8.5 | 9050 | |
| 9 | 8600 | 8.2% |
| 8.3 | 5677 | 5.4% |
| 8 | 4976 | 4.7% |
| 8.8 | 4539 | 4.3% |
| Other values (2) | 6235 | 5.9% |
| Value | Count | Frequency (%) |
| 7.5 | 4096 | 3.9% |
| 8 | 4976 | 4.7% |
| 8.1 | 9941 | |
| 8.2 | 2139 | 2.0% |
| 8.3 | 5677 | 5.4% |
| 8.4 | 9534 | |
| 8.5 | 9050 | |
| 8.6 | 15566 | |
| 8.7 | 21523 | |
| 8.8 | 4539 | 4.3% |
| Value | Count | Frequency (%) |
| 9.1 | 9393 | |
| 9 | 8600 | 8.2% |
| 8.8 | 4539 | 4.3% |
| 8.7 | 21523 | |
| 8.6 | 15566 | |
| 8.5 | 9050 | |
| 8.4 | 9534 | |
| 8.3 | 5677 | 5.4% |
| 8.2 | 2139 | 2.0% |
| 8.1 | 9941 |
facility_score
Real number (ℝ)
High correlation 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8651351 |
| Minimum | 6.9 |
|---|---|
| Maximum | 8.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 6.9 |
|---|---|
| 5-th percentile | 6.9 |
| Q1 | 7.5 |
| median | 7.8 |
| Q3 | 8.3 |
| 95-th percentile | 8.7 |
| Maximum | 8.7 |
| Range | 1.8 |
| Interquartile range (IQR) | 0.8 |
Descriptive statistics
| Standard deviation | 0.50563122 |
|---|---|
| Coefficient of variation (CV) | 0.064287671 |
| Kurtosis | -0.80302545 |
| Mean | 7.8651351 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 0.023523189 |
| Sum | 826106.6 |
| Variance | 0.25566293 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.8 | 12926 | |
| 7.5 | 12729 | |
| 8.7 | 11441 | |
| 7.6 | 9419 | |
| 8.3 | 8112 | 7.7% |
| 6.9 | 6994 | 6.7% |
| 8 | 6854 | 6.5% |
| 7.4 | 4976 | 4.7% |
| 7.2 | 4968 | 4.7% |
| 8.4 | 4817 | 4.6% |
| Other values (7) | 21798 |
| Value | Count | Frequency (%) |
| 6.9 | 6994 | |
| 7.2 | 4968 | 4.7% |
| 7.3 | 2075 | 2.0% |
| 7.4 | 4976 | 4.7% |
| 7.5 | 12729 | |
| 7.6 | 9419 | |
| 7.7 | 4238 | 4.0% |
| 7.8 | 12926 | |
| 7.9 | 2424 | 2.3% |
| 8 | 6854 |
| Value | Count | Frequency (%) |
| 8.7 | 11441 | |
| 8.6 | 1969 | 1.9% |
| 8.5 | 2674 | 2.5% |
| 8.4 | 4817 | 4.6% |
| 8.3 | 8112 | |
| 8.2 | 4423 | 4.2% |
| 8.1 | 3995 | 3.8% |
| 8 | 6854 | |
| 7.9 | 2424 | 2.3% |
| 7.8 | 12926 |
cleanliness_score
Real number (ℝ)
High correlation 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.2577404 |
| Minimum | 7.3 |
|---|---|
| Maximum | 9.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 7.3 |
|---|---|
| 5-th percentile | 7.4 |
| Q1 | 8 |
| median | 8.2 |
| Q3 | 8.7 |
| 95-th percentile | 8.8 |
| Maximum | 9.1 |
| Range | 1.8 |
| Interquartile range (IQR) | 0.7 |
Descriptive statistics
| Standard deviation | 0.43996909 |
|---|---|
| Coefficient of variation (CV) | 0.053279599 |
| Kurtosis | -0.38967786 |
| Mean | 8.2577404 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.20111935 |
| Sum | 867343.5 |
| Variance | 0.1935728 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.7 | 14785 | |
| 8.2 | 11855 | |
| 8.8 | 11508 | |
| 7.9 | 10516 | |
| 8.1 | 9667 | |
| 8.3 | 9060 | |
| 8 | 9046 | |
| 8.4 | 7837 | |
| 7.8 | 4976 | 4.7% |
| 7.3 | 4973 | 4.7% |
| Other values (4) | 10811 |
| Value | Count | Frequency (%) |
| 7.3 | 4973 | |
| 7.4 | 2021 | 1.9% |
| 7.5 | 2075 | 2.0% |
| 7.8 | 4976 | |
| 7.9 | 10516 | |
| 8 | 9046 | |
| 8.1 | 9667 | |
| 8.2 | 11855 | |
| 8.3 | 9060 | |
| 8.4 | 7837 |
| Value | Count | Frequency (%) |
| 9.1 | 4576 | 4.4% |
| 8.8 | 11508 | |
| 8.7 | 14785 | |
| 8.5 | 2139 | 2.0% |
| 8.4 | 7837 | |
| 8.3 | 9060 | |
| 8.2 | 11855 | |
| 8.1 | 9667 | |
| 8 | 9046 | |
| 7.9 | 10516 |
comfort_score
Real number (ℝ)
High correlation 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.2560895 |
| Minimum | 7.3 |
|---|---|
| Maximum | 9.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 7.3 |
|---|---|
| 5-th percentile | 7.3 |
| Q1 | 8 |
| median | 8.2 |
| Q3 | 8.7 |
| 95-th percentile | 8.9 |
| Maximum | 9.1 |
| Range | 1.8 |
| Interquartile range (IQR) | 0.7 |
Descriptive statistics
| Standard deviation | 0.4652842 |
|---|---|
| Coefficient of variation (CV) | 0.056356488 |
| Kurtosis | -0.56329281 |
| Mean | 8.2560895 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.14122156 |
| Sum | 867170.1 |
| Variance | 0.21648939 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 13662 | |
| 7.9 | 12143 | |
| 8.2 | 11161 | |
| 8.8 | 10786 | |
| 8.1 | 9895 | |
| 8.9 | 8834 | |
| 7.3 | 6994 | |
| 8.3 | 6955 | |
| 8.5 | 5616 | 5.3% |
| 8.7 | 4817 | 4.6% |
| Other values (6) | 14171 |
| Value | Count | Frequency (%) |
| 7.3 | 6994 | |
| 7.4 | 2075 | 2.0% |
| 7.8 | 3478 | 3.3% |
| 7.9 | 12143 | |
| 8 | 13662 | |
| 8.1 | 9895 | |
| 8.2 | 11161 | |
| 8.3 | 6955 | |
| 8.4 | 1903 | 1.8% |
| 8.5 | 5616 |
| Value | Count | Frequency (%) |
| 9.1 | 2607 | 2.5% |
| 9 | 1969 | 1.9% |
| 8.9 | 8834 | |
| 8.8 | 10786 | |
| 8.7 | 4817 | |
| 8.6 | 2139 | 2.0% |
| 8.5 | 5616 | |
| 8.4 | 1903 | 1.8% |
| 8.3 | 6955 | |
| 8.2 | 11161 |
value_for_money_score
Real number (ℝ)
High correlation 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.7125198 |
| Minimum | 7 |
|---|---|
| Maximum | 8.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 7.3 |
| Q1 | 7.4 |
| median | 7.7 |
| Q3 | 7.9 |
| 95-th percentile | 8.2 |
| Maximum | 8.3 |
| Range | 1.3 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.32423694 |
|---|---|
| Coefficient of variation (CV) | 0.042040338 |
| Kurtosis | -0.69560853 |
| Mean | 7.7125198 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.18014129 |
| Sum | 810076.8 |
| Variance | 0.10512959 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.9 | 24703 | |
| 7.4 | 14225 | |
| 7.5 | 11117 | |
| 8.1 | 10536 | |
| 7.6 | 10104 | |
| 7.7 | 9196 | 8.8% |
| 7.3 | 7175 | 6.8% |
| 7 | 4973 | 4.7% |
| 8.2 | 4817 | 4.6% |
| 8 | 4363 | 4.2% |
| Value | Count | Frequency (%) |
| 7 | 4973 | 4.7% |
| 7.3 | 7175 | 6.8% |
| 7.4 | 14225 | |
| 7.5 | 11117 | |
| 7.6 | 10104 | |
| 7.7 | 9196 | 8.8% |
| 7.9 | 24703 | |
| 8 | 4363 | 4.2% |
| 8.1 | 10536 | |
| 8.2 | 4817 | 4.6% |
| Value | Count | Frequency (%) |
| 8.3 | 3825 | 3.6% |
| 8.2 | 4817 | 4.6% |
| 8.1 | 10536 | |
| 8 | 4363 | 4.2% |
| 7.9 | 24703 | |
| 7.7 | 9196 | 8.8% |
| 7.6 | 10104 | |
| 7.5 | 11117 | |
| 7.4 | 14225 | |
| 7.3 | 7175 | 6.8% |
location_score
Real number (ℝ)
High correlation 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1724051 |
| Minimum | 8.2 |
|---|---|
| Maximum | 9.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 8.2 |
|---|---|
| 5-th percentile | 8.6 |
| Q1 | 9 |
| median | 9.1 |
| Q3 | 9.4 |
| 95-th percentile | 9.6 |
| Maximum | 9.7 |
| Range | 1.5 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.31021043 |
|---|---|
| Coefficient of variation (CV) | 0.033819966 |
| Kurtosis | 0.42763495 |
| Mean | 9.1724051 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.49813675 |
| Sum | 963414.4 |
| Variance | 0.096230509 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.9 | 16753 | |
| 9.1 | 15117 | |
| 9 | 14049 | |
| 9.3 | 11747 | |
| 9.4 | 11327 | |
| 9.5 | 9970 | |
| 9.6 | 7575 | |
| 9.2 | 6028 | 5.7% |
| 8.6 | 5576 | 5.3% |
| 9.7 | 4817 | 4.6% |
| Value | Count | Frequency (%) |
| 8.2 | 2075 | 2.0% |
| 8.6 | 5576 | 5.3% |
| 8.9 | 16753 | |
| 9 | 14049 | |
| 9.1 | 15117 | |
| 9.2 | 6028 | 5.7% |
| 9.3 | 11747 | |
| 9.4 | 11327 | |
| 9.5 | 9970 | |
| 9.6 | 7575 |
| Value | Count | Frequency (%) |
| 9.7 | 4817 | 4.6% |
| 9.6 | 7575 | |
| 9.5 | 9970 | |
| 9.4 | 11327 | |
| 9.3 | 11747 | |
| 9.2 | 6028 | 5.7% |
| 9.1 | 15117 | |
| 9 | 14049 | |
| 8.9 | 16753 | |
| 8.6 | 5576 | 5.3% |
Crawled_date
Date
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 820.7 KiB |
| Minimum | 2024-12-02 00:00:00 |
|---|---|
| Maximum | 2024-12-16 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
title_word_count
Real number (ℝ)
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4815679 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 16 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.1027253 |
|---|---|
| Coefficient of variation (CV) | 0.93088792 |
| Kurtosis | 1.7959518 |
| Mean | 5.4815679 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.4452381 |
| Sum | 575751 |
| Variance | 26.037805 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 26575 | |
| 2 | 18035 | |
| 4 | 7451 | 7.1% |
| 5 | 6930 | 6.6% |
| 7 | 6697 | 6.4% |
| 6 | 6636 | 6.3% |
| 3 | 5242 | 5.0% |
| 8 | 4448 | 4.2% |
| 9 | 3844 | 3.7% |
| 10 | 3187 | 3.0% |
| Other values (20) | 15989 |
| Value | Count | Frequency (%) |
| 1 | 26575 | |
| 2 | 18035 | |
| 3 | 5242 | 5.0% |
| 4 | 7451 | 7.1% |
| 5 | 6930 | 6.6% |
| 6 | 6636 | 6.3% |
| 7 | 6697 | 6.4% |
| 8 | 4448 | 4.2% |
| 9 | 3844 | 3.7% |
| 10 | 3187 | 3.0% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 29 | 2 | < 0.1% |
| 28 | 7 | < 0.1% |
| 27 | 27 | < 0.1% |
| 26 | 48 | < 0.1% |
| 25 | 131 | 0.1% |
| 24 | 253 | 0.2% |
| 23 | 722 | |
| 22 | 402 | |
| 21 | 558 |
text_word_count
Real number (ℝ)
| Distinct | 419 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.732096 |
| Minimum | 1 |
|---|---|
| Maximum | 666 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 24 |
| Q3 | 48 |
| 95-th percentile | 112 |
| Maximum | 666 |
| Range | 665 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 40.637592 |
|---|---|
| Coefficient of variation (CV) | 1.1063238 |
| Kurtosis | 15.930849 |
| Mean | 36.732096 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 3.0589415 |
| Sum | 3858119 |
| Variance | 1651.4138 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 3087 | 2.9% |
| 7 | 3043 | 2.9% |
| 6 | 2975 | 2.8% |
| 11 | 2826 | 2.7% |
| 9 | 2714 | 2.6% |
| 4 | 2681 | 2.6% |
| 8 | 2598 | 2.5% |
| 12 | 2559 | 2.4% |
| 10 | 2470 | 2.4% |
| 14 | 2388 | 2.3% |
| Other values (409) | 77693 |
| Value | Count | Frequency (%) |
| 1 | 1590 | |
| 2 | 2066 | |
| 3 | 2133 | |
| 4 | 2681 | |
| 5 | 3087 | |
| 6 | 2975 | |
| 7 | 3043 | |
| 8 | 2598 | |
| 9 | 2714 | |
| 10 | 2470 |
| Value | Count | Frequency (%) |
| 666 | 1 | |
| 568 | 1 | |
| 527 | 1 | |
| 510 | 1 | |
| 503 | 1 | |
| 493 | 1 | |
| 491 | 1 | |
| 479 | 1 | |
| 471 | 1 | |
| 469 | 1 |
Deviation of star ratings
Real number (ℝ)
Zeros 
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3003513 |
| Minimum | 0 |
|---|---|
| Maximum | 7.7 |
| Zeros | 3010 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1 |
| Q1 | 0.4 |
| median | 1 |
| Q3 | 1.7 |
| 95-th percentile | 3.9 |
| Maximum | 7.7 |
| Range | 7.7 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 1.2705356 |
|---|---|
| Coefficient of variation (CV) | 0.97707101 |
| Kurtosis | 5.5055816 |
| Mean | 1.3003513 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 2.126962 |
| Sum | 136581.1 |
| Variance | 1.6142606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6 | 9684 | 9.2% |
| 0.4 | 8717 | 8.3% |
| 0.3 | 7140 | 6.8% |
| 1.4 | 6464 | 6.2% |
| 1 | 4595 | 4.4% |
| 1.3 | 4584 | 4.4% |
| 0.7 | 4302 | 4.1% |
| 0.1 | 4274 | 4.1% |
| 1.6 | 3310 | 3.2% |
| 0.3 | 3235 | 3.1% |
| Other values (92) | 48729 |
| Value | Count | Frequency (%) |
| 0 | 3010 | 2.9% |
| 0.1 | 4274 | |
| 0.2 | 612 | 0.6% |
| 0.2 | 2678 | 2.5% |
| 0.3 | 7140 | |
| 0.3 | 3235 | 3.1% |
| 0.4 | 8717 | |
| 0.5 | 945 | 0.9% |
| 0.6 | 9684 | |
| 0.6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 7.7 | 17 | < 0.1% |
| 7.6 | 18 | < 0.1% |
| 7.4 | 105 | 0.1% |
| 7.3 | 60 | 0.1% |
| 7.2 | 9 | < 0.1% |
| 7.1 | 71 | 0.1% |
| 7 | 92 | 0.1% |
| 6.9 | 125 | 0.1% |
| 6.8 | 169 | 0.2% |
| 6.7 | 490 |
Time_lapsed
Real number (ℝ)
| Distinct | 1100 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 505.35697 |
| Minimum | -2 |
|---|---|
| Maximum | 1097 |
| Zeros | 73 |
| Zeros (%) | 0.1% |
| Negative | 29 |
| Negative (%) | < 0.1% |
| Memory size | 820.7 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 199 |
| median | 504 |
| Q3 | 804 |
| 95-th percentile | 1018 |
| Maximum | 1097 |
| Range | 1099 |
| Interquartile range (IQR) | 605 |
Descriptive statistics
| Standard deviation | 330.30957 |
|---|---|
| Coefficient of variation (CV) | 0.65361634 |
| Kurtosis | -1.2577181 |
| Mean | 505.35697 |
| Median Absolute Deviation (MAD) | 303 |
| Skewness | 0.06327195 |
| Sum | 53079664 |
| Variance | 109104.41 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 1141 | 1.1% |
| 33 | 935 | 0.9% |
| 34 | 909 | 0.9% |
| 41 | 901 | 0.9% |
| 39 | 874 | 0.8% |
| 21 | 752 | 0.7% |
| 35 | 705 | 0.7% |
| 42 | 519 | 0.5% |
| 38 | 497 | 0.5% |
| 40 | 448 | 0.4% |
| Other values (1090) | 97353 |
| Value | Count | Frequency (%) |
| -2 | 2 | < 0.1% |
| -1 | 27 | < 0.1% |
| 0 | 73 | |
| 1 | 72 | |
| 2 | 103 | |
| 3 | 102 | |
| 4 | 81 | |
| 5 | 59 | |
| 6 | 83 | |
| 7 | 122 |
| Value | Count | Frequency (%) |
| 1097 | 3 | < 0.1% |
| 1096 | 42 | < 0.1% |
| 1095 | 54 | 0.1% |
| 1094 | 59 | 0.1% |
| 1093 | 80 | |
| 1092 | 158 | |
| 1091 | 91 | |
| 1090 | 81 | |
| 1089 | 80 | |
| 1088 | 70 |
Interactions
Correlations
| Average_Rating | Deviation of star ratings | Helpfulness | Hotel_Name | Num_of_Ratings | Rating | Time_lapsed | cleanliness_score | comfort_score | employee_friendliness_score | facility_score | hotel_grade | is_photo | location_score | text_word_count | title_word_count | value_for_money_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Average_Rating | 1.000 | -0.021 | -0.031 | 1.000 | -0.300 | 0.305 | -0.028 | 0.922 | 0.889 | 0.845 | 0.948 | 0.492 | 0.086 | 0.180 | -0.047 | 0.025 | 0.719 |
| Deviation of star ratings | -0.021 | 1.000 | -0.035 | 0.192 | -0.120 | -0.028 | 0.048 | 0.011 | 0.004 | 0.036 | 0.008 | 0.129 | 0.049 | -0.070 | 0.035 | -0.075 | 0.011 |
| Helpfulness | -0.031 | -0.035 | 1.000 | 0.032 | -0.002 | -0.078 | 0.048 | -0.018 | -0.017 | -0.037 | -0.024 | 0.010 | 0.021 | -0.007 | 0.129 | 0.014 | -0.016 |
| Hotel_Name | 1.000 | 0.192 | 0.032 | 1.000 | 1.000 | 0.167 | 0.168 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.144 | 1.000 | 0.042 | 0.065 | 1.000 |
| Num_of_Ratings | -0.300 | -0.120 | -0.002 | 1.000 | 1.000 | -0.078 | -0.048 | -0.357 | -0.241 | -0.303 | -0.316 | 0.592 | 0.110 | 0.344 | 0.017 | 0.026 | -0.363 |
| Rating | 0.305 | -0.028 | -0.078 | 0.167 | -0.078 | 1.000 | -0.023 | 0.288 | 0.287 | 0.272 | 0.296 | 0.129 | 0.080 | 0.081 | -0.199 | -0.006 | 0.212 |
| Time_lapsed | -0.028 | 0.048 | 0.048 | 0.168 | -0.048 | -0.023 | 1.000 | 0.001 | 0.009 | 0.036 | -0.030 | 0.098 | 0.026 | 0.027 | -0.053 | -0.048 | 0.057 |
| cleanliness_score | 0.922 | 0.011 | -0.018 | 1.000 | -0.357 | 0.288 | 0.001 | 1.000 | 0.961 | 0.850 | 0.951 | 0.531 | 0.082 | 0.114 | -0.043 | 0.018 | 0.756 |
| comfort_score | 0.889 | 0.004 | -0.017 | 1.000 | -0.241 | 0.287 | 0.009 | 0.961 | 1.000 | 0.821 | 0.943 | 0.468 | 0.060 | 0.133 | -0.036 | 0.023 | 0.664 |
| employee_friendliness_score | 0.845 | 0.036 | -0.037 | 1.000 | -0.303 | 0.272 | 0.036 | 0.850 | 0.821 | 1.000 | 0.821 | 0.394 | 0.097 | 0.144 | -0.057 | 0.021 | 0.704 |
| facility_score | 0.948 | 0.008 | -0.024 | 1.000 | -0.316 | 0.296 | -0.030 | 0.951 | 0.943 | 0.821 | 1.000 | 0.654 | 0.088 | 0.096 | -0.041 | 0.016 | 0.704 |
| hotel_grade | 0.492 | 0.129 | 0.010 | 1.000 | 0.592 | 0.129 | 0.098 | 0.531 | 0.468 | 0.394 | 0.654 | 1.000 | 0.053 | 0.412 | 0.018 | 0.024 | 0.331 |
| is_photo | 0.086 | 0.049 | 0.021 | 0.144 | 0.110 | 0.080 | 0.026 | 0.082 | 0.060 | 0.097 | 0.088 | 0.053 | 1.000 | 0.072 | 0.089 | 0.109 | 0.075 |
| location_score | 0.180 | -0.070 | -0.007 | 1.000 | 0.344 | 0.081 | 0.027 | 0.114 | 0.133 | 0.144 | 0.096 | 0.412 | 0.072 | 1.000 | -0.005 | 0.042 | 0.011 |
| text_word_count | -0.047 | 0.035 | 0.129 | 0.042 | 0.017 | -0.199 | -0.053 | -0.043 | -0.036 | -0.057 | -0.041 | 0.018 | 0.089 | -0.005 | 1.000 | 0.218 | -0.038 |
| title_word_count | 0.025 | -0.075 | 0.014 | 0.065 | 0.026 | -0.006 | -0.048 | 0.018 | 0.023 | 0.021 | 0.016 | 0.024 | 0.109 | 0.042 | 0.218 | 1.000 | 0.016 |
| value_for_money_score | 0.719 | 0.011 | -0.016 | 1.000 | -0.363 | 0.212 | 0.057 | 0.756 | 0.664 | 0.704 | 0.704 | 0.331 | 0.075 | 0.011 | -0.038 | 0.016 | 1.000 |
Missing values
Sample
| Hotel_Name | Review_Text | Posted_Date | Rating | Average_Rating | Num_of_Ratings | Helpfulness | is_photo | review_title | hotel_grade | employee_friendliness_score | facility_score | cleanliness_score | comfort_score | value_for_money_score | location_score | Crawled_date | title_word_count | text_word_count | Deviation of star ratings | Time_lapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | studios2let | Perfect location with good connections and shops and pubs | 2024-05-01 | 10.0 | 7.6 | 11670 | 0 | 0 | Exceptional | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 1 | 9 | 2.4 | 215 |
| 1 | studios2let | The room had everything you needed Near to amenities, was good room for price just needs little updatingThe bed was so hard it felt like sleeping on a hard floor, you had to make sure you had something on your feet as flooring pinched you feet needs changing | 2024-12-02 | 8.0 | 7.6 | 11670 | 0 | 0 | Very good | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 2 | 48 | 0.4 | 0 |
| 2 | studios2let | Conveniently nearby St Pancras, very small but clean and pleasant room first floor with small balcony to street side Interesting areaLuggage service can be improved by offering to lock luggage up instead of it just being put into the hall with all risks on the guests | 2024-12-01 | 8.0 | 7.6 | 11670 | 0 | 0 | Convenient location | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 2 | 46 | 0.4 | 1 |
| 3 | studios2let | Reception staffed 24 hours a dayAll good | 2024-12-01 | 9.0 | 7.6 | 11670 | 0 | 0 | Peaceful position in an elegant street close to 3 major stations and the Bloomsbury area | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 15 | 7 | 1.4 | 1 |
| 4 | studios2let | Very convenient to Kings Cross and the cityA little dated could do with a lick of paint | 2024-11-30 | 8.0 | 7.6 | 11670 | 0 | 0 | Great little gem in the city centre | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 7 | 17 | 0.4 | 2 |
| 5 | studios2let | Located in a quiet area but close to Kings Cross station so getting around was easy Several little pubs nearby for dining and some good coffee shops tooThere is no lift so dragging a heavy suitcase up and down stairs was challenging We had booked a room with terrace but the outdoor space was really minuscule not what we had expected from the photos | 2024-11-30 | 7.0 | 7.6 | 11670 | 0 | 0 | Convenient, quiet location | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 3 | 65 | 0.6 | 2 |
| 6 | studios2let | Its spacious, good value and so very quiet for LondonYou sometimes have to wriggle the loo flusher to stop it running and running | 2024-11-30 | 9.0 | 7.6 | 11670 | 0 | 0 | Superb | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 1 | 23 | 1.4 | 2 |
| 7 | studios2let | LocationLot of stairs bad knee | 2024-11-29 | 9.0 | 7.6 | 11670 | 0 | 0 | Ideal location for travelling round | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 5 | 5 | 1.4 | 3 |
| 8 | studios2let | Location was great, so near the stationWe were on the top floor, six flights of stairs and no lift\r\nHeating was on 247 full temperature and no means of reducing it | 2024-11-29 | 7.0 | 7.6 | 11670 | 0 | 0 | Perfect location, | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 2 | 31 | 0.6 | 3 |
| 9 | studios2let | The location which is excellent for public transport and local dining \r\nFriendly staffed reception where we could leave our travel bags all day after checking outThe climb up 3 flights of stairs was exhausting but it was our choice\r\nIt was a small room and the kitchen facilities were very sparse but we didnt need them | 2024-11-28 | 8.0 | 7.6 | 11670 | 0 | 0 | Ideal accommodation for a short stay in London near St Pancreas station | 3 | 8.3 | 7.5 | 7.9 | 7.8 | 7.6 | 9.3 | 2024-12-02 | 12 | 57 | 0.4 | 4 |
| Hotel_Name | Review_Text | Posted_Date | Rating | Average_Rating | Num_of_Ratings | Helpfulness | is_photo | review_title | hotel_grade | employee_friendliness_score | facility_score | cleanliness_score | comfort_score | value_for_money_score | location_score | Crawled_date | title_word_count | text_word_count | Deviation of star ratings | Time_lapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105024 | montanahotel | Perfect Location | 2023-09-02 | 10.0 | 7.8 | 6248 | 0 | 0 | Lovely staff | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 2 | 2 | 2.2 | 471 |
| 105025 | montanahotel | It was good enoughDelayed check in, cracked basin in bathroom, water pressure poor Everything else was fine | 2023-07-11 | 6.0 | 7.8 | 6248 | 0 | 0 | Not a bad place to stay if its where you need to be | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 13 | 17 | 1.8 | 524 |
| 105026 | montanahotel | locationhot room, shower didnt drain, broken sink | 2023-07-01 | 5.0 | 7.8 | 6248 | 0 | 0 | Great location and generally clean spot but the place is a bit a dated and the basement room was damp, hot and a bit mus | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 25 | 7 | 2.8 | 534 |
| 105027 | montanahotel | Good to have teacoffee and a fridge in the roomThe building is beautiful but the interior decor leaves a lot to be desired The hotel is Indian in styleredgoldfaded wallpaper and threadbare carpetsthe room was OK but again needed updating We were in the basement with no viewonly rubbish out of the window There was no hot breakfast so only cereals, fruit and pastriesbut it was in a pleasant locationnear the tube and shopspubs etconly a short walk to the Natural History museum | 2023-04-25 | 6.0 | 7.8 | 6248 | 0 | 0 | Lovely building with quite a grand entrancelet down by the interiorfine for overnight stay | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 14 | 83 | 1.8 | 601 |
| 105028 | montanahotel | locationwater pressure was non existent\r\ndespite several request to address the problem | 2023-03-25 | 3.0 | 7.8 | 6248 | 0 | 0 | while the staff was nice They did very little to remedy the lack of shower and hot water problem we had | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 22 | 12 | 4.8 | 632 |
| 105029 | montanahotel | Convenient and classy The staff are excellent people, and Light of India is a fantastic restaurant I would certainly stay againNA | 2022-12-28 | 10.0 | 7.8 | 6248 | 0 | 0 | Highly recommend this little gem situated in my favourite part of town | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 12 | 21 | 2.2 | 719 |
| 105030 | montanahotel | lovely atmosphere, extremely friendly and helpful staff | 2022-07-01 | 10.0 | 7.8 | 6248 | 0 | 0 | Perfect location for our visit to the Royal Albert Hall and the Natural History Museum would | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 16 | 7 | 2.2 | 899 |
| 105031 | montanahotel | It was a single room, a little small but it was fine for 1 person, it had everything I needed | 2022-06-28 | 10.0 | 7.8 | 6248 | 0 | 1 | The staff were very friendly and helpful The position was perfect for sightseeing | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 13 | 20 | 2.2 | 902 |
| 105032 | montanahotel | Very clean and well maintainedThe rooms are very nice and comfortable with staffs professionalismThe food are delicious,nice breakfast,lunch ,dinner and the cocktails are exceptionalNotting much just that theres no parking | 2022-02-16 | 10.0 | 7.8 | 6248 | 0 | 1 | Myself and my wife really enjoy our stay at this hotel,we love the service and all the staffs are amazingLooking forwar | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 21 | 30 | 2.2 | 1034 |
| 105033 | montanahotel | The staff were very friendly and helpful Especially Kampas The hotel was very clean and the fact that they had a wonderful Indian restaurant as part of it was amazing Best Vindaloo everShower a tad small but adequate xx | 2022-02-06 | 10.0 | 7.8 | 6248 | 0 | 1 | Loved every minute we will be back Xxx | 3 | 9.0 | 7.7 | 8.2 | 8.2 | 8.0 | 9.4 | 2024-12-16 | 8 | 39 | 2.2 | 1044 |
Duplicate rows
Most frequently occurring
| Hotel_Name | Review_Text | Posted_Date | Rating | Average_Rating | Num_of_Ratings | Helpfulness | is_photo | review_title | hotel_grade | employee_friendliness_score | facility_score | cleanliness_score | comfort_score | value_for_money_score | location_score | Crawled_date | title_word_count | text_word_count | Deviation of star ratings | Time_lapsed | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 65 | park-plaza-county-hall | Brilliant location on the South Bank opposite County Hall so great for events there I was upgraded to a room with an incredible view over Waterloo Station and to the Shard beyond Rooms are soundproof so no sound from the trains The staff I encountered were all very friendly from my welcome at check in to smiles from any other staff member I sawWould be good to offer glass bottles of water rather than the small plastic bottles | 2024-11-11 | 10.0 | 8.6 | 11045 | 0 | 0 | Great location, friendly staff and amazing views | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 7 | 80 | 1.4 | 21 | 213 |
| 66 | park-plaza-county-hall | Free upgrade, Excellent customer service, beautiful room | 2024-11-10 | 10.0 | 8.6 | 11045 | 0 | 0 | Wonderful staff | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 2 | 7 | 1.4 | 22 | 213 |
| 67 | park-plaza-county-hall | Friendly staff Excellent serviceNothing | 2024-11-11 | 9.0 | 8.6 | 11045 | 0 | 0 | From every perspective a highly satisfactory stay | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 7 | 4 | 0.4 | 21 | 213 |
| 68 | park-plaza-county-hall | Great location, excellent attentive staff and great customer careNothing it was perfect | 2024-11-10 | 10.0 | 8.6 | 11045 | 0 | 1 | From start to finish our stay was perfect we were well looked after and had a fabulous room with a view Would definitel | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 23 | 12 | 1.4 | 22 | 213 |
| 70 | park-plaza-county-hall | LocationNo parking place in front | 2024-11-10 | 8.0 | 8.6 | 11045 | 0 | 0 | It was good only faced problems in getting taxi | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 9 | 5 | 0.6 | 22 | 213 |
| 74 | park-plaza-county-hall | The hotel was ideally located for us We were upgraded to a studio room and given a bottle of wine Nice touch Breakfast was plentiful and good choicesThere was a mouse running around on Thursday evening in the bar Very tiny but a bit unsettling with food being around Didnt like they took 310 pending transaction without telling me I didnt owe a penny at all A bit much to check my card worked Said they have reversed it but not seen it yet | 2024-11-10 | 9.0 | 8.6 | 11045 | 1 | 0 | Superb | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 1 | 84 | 0.4 | 22 | 213 |
| 77 | park-plaza-county-hall | The reception area, the lift views and our room was exceptional | 2024-11-07 | 10.0 | 8.6 | 11045 | 0 | 0 | We stayed with our daughter as we had tickets to ABBA voyage | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 12 | 11 | 1.4 | 25 | 213 |
| 81 | park-plaza-county-hall | Very close to some of the attractions like the London Eye, Westminister Abbey etc Rooms were spacious and neat I took the room which had a sofacum bed which gave us the extra space, especially with a kid Perfect for a family of 3We landed a bit early around 1030 While the checkin time was 1500 hrs, we requested if we could get the room a bit earlier I was ready to pay for an extra day too A person at reception counter mentioned that the room will get cleaned, but was very rude while he was talking When I tried checking after an hour to get an ETA, he simply denied to give an ETA He said it could take 30 mins or 2 hours, which I felt very rude However another wonderful lady at the reception was more helpful and got the room ready in 3040 mins | 2024-11-09 | 8.0 | 8.6 | 11045 | 0 | 0 | Neat comfortable place with major attractions nearby | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 8 | 149 | 0.6 | 23 | 213 |
| 83 | park-plaza-county-hall | Very good hotel, spacious room with a great view Look forward to staying here again in future | 2024-11-11 | 10.0 | 8.6 | 11045 | 0 | 0 | Exceptional | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 1 | 17 | 1.4 | 21 | 213 |
| 85 | park-plaza-county-hall | location is great, quite, safe and close to restaurants, coffee shops, close to London eye easy to reach\r\nfast check in out, great front desk team\r\nroom is spacious and beds are comfortablethe cleaning service is not good comparing it to my previous stay\r\nsome furniture need to be renewed | 2024-11-10 | 8.0 | 8.6 | 11045 | 0 | 0 | Great Stay | 4 | 9.0 | 8.7 | 8.8 | 8.9 | 7.9 | 9.5 | 2024-12-02 | 2 | 52 | 0.6 | 22 | 213 |